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ON THE LARGEST COMPONENT OF A RANDOM GRAPH WITH 
A SUBPOWER-LAW DEGREE SEQUENCE IN A 
SUBCRITICAL PHASE 

By B. G. Pittel 

Ohio State University 

A uniformly random graph on n vertices with a fixed degree se- 
quence, obeying a 7 subpower law, is studied. It is shown that, for 
7 > 3, in a subcritical phase with high probability the largest com- 
ponent size does not exceed n 1/ ^ 7+e ", e n = 0(lnlnn/lnn), I/7 being 
the best power for this random graph. This is similar to the best pos- 
sible n 1 /*- 7-1 - 1 bound for a different model of the random graph, one 
with independent vertex degrees, conjectured by Durrett, and proved 
recently by Janson. 

1. Introduction. In a recently published book ([5], Section 1.2), Durrett 
formulated the following conjecture. 

Let p = {pj}j>i be a probability distribution. Let Di,...,D n be i.i.d. 
random variables, each having the distribution p. Consider a graph on the 
vertex set [n] , chosen uniformly at random among all graphs with the degree 
sequence (D\, . . . ,D n ). For such a set of graphs to be nonempty it is nec- 
essary that max Di < n and J2i^i is even. (The first condition holds with 
probability approaching 1 if E[D] < 00, and the second condition holds with 
probability approaching 1/2 if E[D 2 ] < 00, and g.c.d.{j :pj > 0} is odd.) 
Durrett states that at a vicinity of a generic vertex v the random graph 
looks like a tree rooted at v, and the number of direct descendants of every 
descendant of v has a distribution q = {qj}j>o, 
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If so, under the condition 



j 



one should expect that the component containing v, and even the largest 
component, are likely to be small compared to n. Specifically, Durrett con- 
jectured that for the power-law distribution, 

Pj = Cj' 1 , 7 > 3, 

the likely size of the largest component should be of order ra 1 ^ 7-1 ), exactly. 
In other words, the largest component has size of order of the maximum 
vertex degree. Janson [6] has recently proved Durrett's conjecture. 

In this paper we consider a different model of the random graph, in which 
a degree sequence is fixed. There is given a tuple d = d(n) = (d\, . . . , d n ) of 
positive integers d\, . . . ,d n < n such that d\ + ■ ■ ■ + d n is even. We consider a 
sample space Q n< d of all graphs on [n] with the degree sequence d. Introduce 
the empirical degree distribution 

r t \{ie[n]:ck = j} 

p = {pi,...,p n _i}, pj-.= . 

n 

Let q = q(p) be defined by (1.1). Assuming that p obeys a subpower law, 
that is, 

(1.2) Pj<cj~\ l<j<n-l, 

with 7 > 3, we show that Q n ^ is nonempty. We prove that, under the con- 
dition 

(1.3) £jfc< £>0, 

i>i 

the largest component in the graph G nt d, chosen uniformly at random from 
Q n ,d, bas size C n = O p (n 1//7 Inn), that is, C n j (n 1 / 7 Inn) is bounded in prob- 
ability. Similarly to Janson's result for the independent degrees model, the 
power I/7 is the best possible for the fixed-degree-sequence model, since 
among the degree sequences d in question there are those with max^g^ d v 

of the exact order n 1 / 7 . 

That, under the condition equivalent to (1.3), C n /n — ► in probability, 
had already been proved by Molloy and Reed [7, 8]. They also proved that, 
under their form of the condition 

m 

with high probability the random graph G n)< \ has a giant component of size 
0(n), even being able to establish, under additional conditions, the limit of 
that size scaled by n. 
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Following the footsteps of Molloy and Reed, our proof is based on analysis 
of an algorithm that determines the component containing a given vertex. 
We construct a collection of exponential supermartingales in order to prove, 
via the optional sampling theorem, that the random growth of that com- 
ponent follows closely a certain deterministic path. See [1] and [9], where a 
similar approach was used for analysis of the site (bootstrap) and the bond 
percolation on a random regular graph. 

2. Main result and proofs. Let d\,...,d n be positive integers, such that 
d\ + ■ ■ ■ + d n is even. Let Q Ui d denote the sample space of all graphs on the 
vertex set [n] that have the degree sequence d = (d\, . . . ,d n ). Denote by G n ^ 
the random graph which is distributed uniformly on Gn,d- 

In parallel, let MQ n< \ denote the sample space of all multigraphs with the 
degree sequence d. Let us describe the random multigraph MG n ^ suggested 
first by Bollobas [3]. Consider the disjoint sets Si,...,S n of cardinalities 
d±, . . . , d n ; set Si representing vertex i G [n]. (Some people prefer assigning 
di "half-edges" to a vertex i G [n], instead of sets Si, but the difference is 
purely linguistic.) We know that S := Ui'S'i has an even cardinality 2m := 

d\-\ \-d n . Introduce the sample space V n> d of ah (2m — 1)!! = 1 - 3 • • • (2m — 

1) pairings on S. Let P n ,d be the random pairing distributed uniformly 
° n Tn,A- Define MG n ^ as follows: two vertices i,j G [n] are joined by an 
edge iff there are s' € Si, s" G Sj such that {s',s"} is one of the pairs in 
P n . d . Obviously MG n d may well have loops and multiple edges. And it 
is not uniformly distributed on MQ n &. However, conditioned on the event 
A n := {no loops, no multiple edges}, MG n ^ is a simple graph distributed 
uniformly on <7 n) d, hence can be viewed as the random graph G n &. (This 
connection is due to the observation that every G G Q n> d induces the same 
number, d\\ ■ ■ ■ d n \, of pairings in V n ,d-) 

Suppose that d = d(n) is such that 

(2.1) liminfP(^ n ) >0. 

Under (2.1), any asymptotically rare (sure) event for MG„ ]( j is an asymp- 
totically rare (sure) event for G n ( ±- And we will see that the probability 
estimates for the events in MQ n> <\ become quite manageable once translated 
into the language of the space V n> d- 
Introduce 

, 00 , / \ T,ie[n]di(di-1) 

(2.2) v = v(n) = — . 

v can be interpreted as the expected outdegree of a nonroot vertex in a tree 
rooted at a given vertex v, which, heuristically, is how G n ,d looks like in a 
vicinity of v. Let 

Pj = Pj( n ) := ~\i ie N -di=j}\, je[n-l]. 
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Then (2.2) becomes 

v= ^ — ■ : , 

l~,je[n-l] JPj 

which is the ratio of the first two factorial moments of the distribution {pj } . 
We denote the first moment, the average vertex degree, by d = d(n). 
We assume that d = d(n) is such that 

(2.3) limsup j 2 pj < co, lim n~ l j 4 Pj = ®- 

n— >oo . , n— »oo 

je[n-l] je[n-l] 

In fact, we assume a stronger condition, namely that {pj} is a subpower-law 
distribution, that is, 

(2.4) Pj <cj~\ 7 >3. 
In this case, since 

| {t € [n] : = j} | = n Pj < < 1 Vj > j n := [A(j, c)n 1 ^ \ , 

we see that 

\{ie[n]:di=j}\=0, Pj =0Vj> j n . 

In other words, max iG [ n ]dj, the largest vertex degree, is j n , at most. That 
the first condition in (2.3) is met under (2.4) is obvious; the second condition 
holds true, since (2.4) implies 

^ E j 4 p; = °("" 1/3 ); 

j€[n-l] 

see (2.19). 

Lemma. Under the condition (2.3), 

liminf P(^4 n ) > exp(— vjl — ^ 2 /4) > 0, v := limsupzA 

Note. Applied to the degree sequence d = (d, . . . ,d), this lemma yields 
a well-known asymptotic formula for the number of all <i-regular graphs, due 
to Bender and Canfield [2]. 

We prove Lemma in the Appendix. 

Theorem. Let C n denote the size of the largest component (cluster) 
of the random graph G n ^. Under the condition (2.4), for X = A(n) — > oo 
however slowly, 

lim P{C n < An 1 / 7 Inn} = 1, 

n— >oo 
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provided that 
(2.5) 

in short, C n = Op(n 



I' 1 In n) . 



limsupi/ < 1; 



Proof. We will prove the bound by upper-bounding the likely size of 
a component that contains a generic vertex w£ [n]. In view of Lemma, it 
suffices to bound the size of the component of the random multigraph MG n ^ 
that contains vertex v. 

Notice that a subset V of [n] is the vertex set of this component iff for 
every u G V there exist wi, . . . , Wk G [n] such that, for some sq G S v ,s\ G 
Su,!,. . . ,s k G Su, fc ,sjfc+l G S'u, all the pairs {s ,si}, . . . ,{s fc ,s fc+ i} are in P n;d . 
So we may, and will, deal with the corresponding "component" in P n ^ itself. 
We determine this component algorithmically, by adding to a current cluster 
of pairs exactly one new pair {s',s"} G P n ,d, where a point s' is not in the 
current cluster of pairs, but has the same vertex "host" as one of the points 
in those pairs. [We will call them the (currently) active points.] If the point 
s" , the partner of the point s', is hosted by a fresh vertex, u, then u joins 
the current vertex cluster, and the d u — 1 still unexplored points hosted 
by u become active. As a result, the number of active points changes by 
(— 1) + (d u — 1) = d u — 2. If s" happens to be hosted by a vertex from the 
current vertex cluster, then the vertex cluster remains the same, but the 
number of active points decreases by 2. 

Importantly, instead of generating the uniformly random paring P n d in 
advance, we can generate it one pair at a time, as called for by the algorith- 
mic process. Namely, given a total ordering of the points in S, as s' we pick 
the first, say, active point and pair it with a point s", chosen uniformly at 
random among all points not in the pairs. 

Let A(t), Ij(t) denote the total number of the currently active points and 
the number of the currently inactive (not in the current cluster, i.e.) vertices 
after t steps of the algorithm. In particular, 



(2.6) 



A(0) = d, 



7^ (0) = npj -5 jA 



3<jn- 



Introduce 



the total number of inactive points. From the discussion above, 
A(t + 1) + I(t + 1) = A(t) + J(t) - 2, 

so that, by (2.6), 



(2.7) A(t) + I(t) = A(0) + 1(0) -2t = nd- 2t, 
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where 

i<n j<jn 

is the average vertex degree. From (2.7), the process will terminate no later 
than by time t < nd/2. 

Clearly {A(t), {Ij(t)}j<j n }t>o is a Markov chain, and if 



t<T = T v := min{r > : min{A(r), I(t)} = 0}, 



then 

(2.8) P[I j (t+l) = I j (t)-l\F t ] 



A(t) + I(t)-V 
E[I j (t + l)\F t ] = L(t) + (-l 



(2.9) E[A(t + l)\T t ]=A(t) + (-2) 



A(t)+i(t)-r 

A(t) + I(t) - 1 



here P[- | J-f], E[- | J-f] denote the probability and the expectation conditioned 
on {A(t),{Ij(t)}j<j n }. Since T is a stopping time, it follows from (2.8) and 
(2.7) that, for each j < j n , 

t<T, 



Xj(t):=\ n*=o(l-i/H-2r-l))' 

Xj(T), t>T, 

is a martingale, with 

(2.10) nXj(t)] = Xj(0) = 1,(0) = n Pj - 8 hdv . 

We want to show that, for t = 0(n a ), a € (7" 1 , 1 — 7" 1 ), Xj(t) is relatively 
close to Xj(0), with probability very close to 1. (Of course, we focus on a 
close to 7 _1 , since we expect the process to terminate around a time close 
to n 7 .) To this end, first let us prove that the sequence 

Qj(t) := exp[n&Xj(t)/n], t < n a , 

is "almost" a (super)martingale, provided that 

7 _1 + a < 1, 

(2.11) 

2(/3 J -l)=min{o,-a + (7-l)^}. 
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Let t <T. Observe that, for j < j n , 



T=0 



(2.12) =(1-4 (l + 0(j/n)) 



= 1 + 0(jt/n) 

= l + 0(n r ~ 1+a - 1 )^l. 

Consequently, using 

< Ij (t) - Ij (t + 1) < 1, jlj (t) < jlj (0) = j Pj n < dn, 
we obtain 

n^Xjit+l) -n^Xjit) 



n 



nU(i-j/M-2r-i)) 

roi^ x I j (t)j/(nd-2t-l) 

[ } nU(l-i/(nd-2r-l)) 

= O^" 1 ^ + 1) - + 0(j/i(t)n- 2 ) 

= 0(n" 1 ). 

Therefore, as (3j <1, 

= exp^rT 1 ^ + 1) - n-^-Ct))] 

= l + n /3 ^ 1 (X i (t+l)-X i (t)) 

+ 0(7^- 1 )(X i (t + l)-X,(t)) 2 ). 
Since Xj(t) is a martingale, we have 

E[X j (t+l)-X j (t)\F t } = 0. 
Further, by (2.13) and (2.8), 

EKA^t + l)-^)) 2 !^] 

= 0(E[(/,(i + 1) - Ij(f)) 2 \F t ]) + 0({ 3Pj f) 

= o(jp j + UPj) 2 ) = o(jp j ). 
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Consequently, for t <T, and trivially for t>T, 



" -E[Q,-(t + 1) | T t \ = 1 + 0(n^-^j Pj ) 



QM 

= l + 0(n 2 ^^V 7+1 ) 

= l + 0(7^°), 

the third equality and the fourth equality following from (2.4) and the defi- 
nition of (5j in (2.11), respectively. Thus, there exists e n > 0, e n = 0(n~ a ), 
such that 

E[(l + eJ-^QjQ + 1) | T t ] < (1 + en)"*Qj(t), * < « Q 
It makes 

Q j (t):=(l+ £n )- t Q i (t), t<n Q , 

a supermartingale, that differs from Qj(t) by a factor bounded away from 
both zero and infinity. 

Given j < j n , and z > 0, introduce a stopping time Tj(z), the first t < 
n" AT such that 



n A „\ nd — 2T — lJ n 

T = U 



z 



and set = [n Q + 1] if no such t exists. By (2.10), for t < n a A T and 

i < min,- Tj(z), we have 



(2.14) I 3 {t) = (n Pj - 8 hdv ) ( 1 - + OCn^z). 



t-i 

A nd-2r-l / 
Applying the Optional Sampling Theorem to the supermartingale 

= (1 +£„)"* exp[fiA(A>(t)/n - *i(0)/n)], t < n Q , 

the stopping time Tj{z) (Durrett [4], Section 4.7), and using Markov in- 
equality, we have: uniformly for z > 0, and j <5 j n , 

P{7}(*) = [n a + ll} = 0(e- z ). 

Choosing z = xlnre, (x > 7 )i an d introducing 

B„ = (min^(z) = K + llj, 

we obtain then 

(2.15) P(B n ) > 1 - 0(j n e- 2 ) = 1 - 0(n 7 "V* lnn ) = 1 - 0(n-* +7_1 ). 
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Notice that, on the likely event B n , (2.14) holds for all t < n a A T. 
Armed with (2.14), we turn our attention to (2.9) for 

E[A(t + 1) | F t ], t < n a A T and t < mmTj(z). 



j 



By (2.14), for the sum in (2.10) we can write 



£ M - 2)7,(1) = n •£ j(] ~ 2) Vi g (l - 

(2.16) 

+ 0(d 2 ) + o(lnn J] jV-ftj. 
Here <i 2 = 0(n 27 ), and by the definition of /?,-, 

E/« H '< E iV- ft + Ej a 

j<?n j<n Q /(7-l) i<in 

= - a/2 E 7^ + oO"2) 

_ { 0(n 3a /(T- 1 )lnn) + 0(n 3 ^ 1 ), 7 < 7, 
"\0(n Q / 2 )+0(n 37_1 ), 7 >7. 

Or 

(2.17) £jV-A=0( n *T 1 ), 

j°<in 



for a close to 7 . Furthermore, using (2.12), 

nE^-2)p/n(l-^3l) 



(2.18) 



Here, by 

1 — mi < (1 — x) m < 1 — mi + ) x 2 , 

we have 
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for j = 1 we need the lower bound as j(j — 2) |j=i < 0. Therefore 



n 

i<i 



9f \ i/ 2 



(2.19) 

= n^i(j-2)p,+0(n^- 1 ), 

as a + 7 -1 < 1; see (2.11). [We have used the bounds 

j 3 Pj = 0(n max {°'( 4 -^ _1 hnn), 

j<jn 

(2.20) 

fvj = 0(n max ^°'( 5 -^ 1} lnn), 

which easily follow from (2.4).] 

Combining (2.16)-(2.19), we obtain: for i < n a A T, t < min,- 7} (2), 

(2.21) £ j(i " 2)Ij(t) < n £ j(j ~ 2) Pj + Oin 3 ^ 1 Inn), 

3<3n j<jn 

if a is close to 7 _1 . Notice that 



j 

so that 



= - 1), 



limsup^] j(i - 2 )Pj < °> 



which is the Molloy-Reed condition for the subcritical phase. So, by (2.21) 
and the condition 7 > 3, (2.9) implies that 



E[A(t + 1) I F t ] < A(t) - a 

(2.22) 

a := \ limsup(l — u) > 

for all n large enough. 
The rest is short. Set 



t < T An a and t < mmTj(z) 

j 



A(t + l) = A(t)-a 



t > T A n a or t > mmTj(z) 
j 



(2.23) 
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Clearly the extended sequence {^4(t)} satisfies (2.22) for all t. Besides, since 
T = T V is the first time r when 

min{A(r),/(r)} = 0, 

we have 

{n a <T}n jmin25(z) = \n a + 1] J = {n a < T} n B„ 

c{A(K])>o}. 

Furthermore, since the maximum vertex degree is j n at most, 

A(0) = d v <j n , \A(t + l)-A(t)\<j n ; j n = 0(n^ 1 ). 

Also, reading out the conditional distribution P{^4(t + 1) — A(t) = i \ J~t} from 
(2.10), and using (2.20), 

E[(A(t + 1) - A{t)) 2 | F t ] < 4 + - fPj = 0( ? i max {°'( 4 -^ -1 } Inn), 

3<jn 

if t < T An" and i < min^ Tj(z). And the bound holds trivially for the larger 
values of t. Then 

E[exp(n-^~\A(t + l)-A(t)))\JH] 

= l + n^ 1 E[A{t + l)-A(t)\T t } 

+ 0{n-^ 1 E[{A(t+\)-A(t)f\r t ]) 

< 1 - an^ 1 + (2- 7 ) 7 -l} ^ 

< 1 — 6n~ 7 , b < a, 
since 7 > 3. Therefore 

E[exp(n- 7_1 (A(t + l) -^(i))) |^t] <exp(-6n- 7_1 ), 

and then 

E[exp(nr^~ % A(t))] = 0(exp(-t6n- 7_1 )). 

Hence 

P{A(t) > 0} = 0(exp(-t6n- 7_1 )). 
In particular, choosing 

1 In Inn n 

2.24 a = 7 ~ 1 + - + r L , n>0, 

In n in n 
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which certainly satisfies the inequality 7" 1 + a < 1 in (2.11) for n > n(rj), 
we obtain 

P{A([n a ])>0} = O(n- br >). 

By (2.23), we have then 

P{(n a <T v )nB n } = 0(n- br i), 

and combining this estimate with (2.15) we conclude: for any fixed x > 
and r] > 0, 

P{n Q < T v } = P{e V 1 Inn < T v } = 0(n" x+7_1 + n~ br <). 

Of course, a bounded constant factor implicit in the big-Oh notation depends 
on x an d rj. Thus, given K > 0, there exists L = L(K) such that 

P{Lra 7_1 Inn < T„} <n~ K - 1 , ve[n], 

whence 

P < max T v > Ln 1 In n > < n~ K . 
[ve[n] J 

It remains to notice that the component containing the vertex v has size T v 
at most, so that C n , the size of the largest component, is max,, T v , at most. 

Note. If, instead of (2.24), we had set 
i uj(n) 

a = 7 +- , oj(n) — > cxd, u(n) = o(mn), 

Inn 

we would have proved that 

(2.25) P{w(n)n^ 1 < T v } = 0(n- x+7_1 + e"^ (n) ), 

so that T w = O p (n 7 X ). However, the e~ buJ ^ n ) term in (2.25) would not have 
allowed us to deduce that max^ e [ n ] T v = O p (n 7 1 ) as well. □ 



APPENDIX 

Proof of Lemma. Our argument is patterned after Bollobas's proof 
[3] of a similar, but more general, result for maxjdj = 0(1). 

Let X n and Y n denote the total number of loops and the total number of 
pairs of parallel pairs in the random pairing P n ,d- We want to show that, 
for every fixed k and £, 

E[(X n ) k (Y n ) £ ] ~ , n^oo, 
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where (a)b stands for the falling factorial a(a — 1) • • • (a — b + 1). This would 
imply that X n and Y n are asymptotically independent, and Poisson dis- 
tributed, with parameter i//2 and (z//2) 2 , respectively, and the statement 
would follow, since 

P(A n )=P{X n = 0,Y n = 0}. 

Combinatorially, (X n ) k (Y n )e is the total number of samples, with order and 
without replacement, of k loops and of £ pairs of parallel edges from the ran- 
dom pairing P n ,d- Given any such sample, let S^, . . . , Si k+2l be the ordered 
sequence of sets such that Si , j <k, contains the jth loop, and, for 1 < t < £, 
the tth pair of parallel pairs is (s\,S2), (53,54), where si,S2 € <S'j fe+2t _ 1 , 
S3, S4 € Si k+2t . We write 

E[(X n ) k (Y n ) e ] = E 1 + E 2 . 

Here E\ is the expected number of the samples such that i\ 7^ ■ ■ ■ 7^ i k +2e, 
and Ei is the expected number of all other samples, when at least two indices 
among ij, l<j<k + 2£, coincide. Then 

{nd _2k-U-l)\\ _ k rf( d ^ 

1_ — — 1 2 

v ' H^-frk+st s=l 



Explanation. Let a sequence i\ 7^ %% 7^ • • • 7^ ik+21 be given. From each 
set Si j we choose two points, in risii (2 s ) wavs overall. We pair two points 
from each S 1 ^. , j < k, thus forming k loops. For each t £ [1,£], we match two 
chosen points in Sj. + 2t-i with two chosen points in S k+ 2t, in 2 l ways overall, 
and then divide by 2 e to account for irrelevance of the order in which every 
two sets, S k+ 2t-i and S k+2 t, appear in the sequence S ik+1 ,. . . , S ik+2l . 

Introduce 

k+2l 

ex= e n 

il,—,ik+2i 8=1 

*=(* + » M ) £ rf(1-> 

h=i2,i3,—,ik+2t s=1 

so Si is a counterpart of E\, with the indices i\, . . . , i k +2i allowed to coincide, 
and S2 is an upper bound of the total sum of terms in Si, but not in E\. 
Clearly then 
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Further 

, \ k+2l / \ k+2l 



so that 



Next 



so that 



2 > , \ k+2£-2 



fed' 



since, by (2.20), 



n" 2 



E^=«- 1 E^i=°(»" 1/3 )- 

» J<Jn 



Therefore 



Ei ~ [ - ) , n->oo. 



Finally 

(nd-1 



(nd-2k-4£-l)U „ ^/ A 



Therefore 



E[(X n ) fc (y n ),] = J E 1 + 0(^ 2 )~ (-) , n->oo. 



□ 
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